Dataset statistics
| Dataset A | Dataset B | |
|---|---|---|
| Number of variables | 8 | 8 |
| Number of observations | 21229 | 10415 |
| Missing cells | 23254 | 11349 |
| Missing cells (%) | 13.7% | 13.6% |
| Duplicate rows | 770 | 227 |
| Duplicate rows (%) | 3.6% | 2.2% |
| Total size in memory | 1.5 MiB | 732.3 KiB |
| Average record size in memory | 72.0 B | 72.0 B |
Variable types
| Dataset A | Dataset B | |
|---|---|---|
| Categorical | 2 | 2 |
| Numeric | 6 | 6 |
| Dataset A | Dataset B | |
|---|---|---|
| Dataset has 770 (3.6%) duplicate rows | Dataset has 227 (2.2%) duplicate rows | Duplicates |
pressure is highly overall correlated with d_e and 2 other fields | pressure is highly overall correlated with d_e and 2 other fields | High Correlation |
d_e is highly overall correlated with pressure and 1 other fields | d_e is highly overall correlated with pressure and 1 other fields | High Correlation |
d_h is highly overall correlated with pressure and 4 other fields | d_h is highly overall correlated with pressure and 4 other fields | High Correlation |
length is highly overall correlated with d_h and 1 other fields | length is highly overall correlated with d_h | High Correlation |
author is highly overall correlated with d_h and 1 other fields | author is highly overall correlated with d_h and 1 other fields | High Correlation |
geometry is highly overall correlated with pressure and 3 other fields | geometry is highly overall correlated with pressure and 2 other fields | High Correlation |
author has 3403 (16.0%) missing values | author has 1621 (15.6%) missing values | Missing |
geometry has 3713 (17.5%) missing values | geometry has 1787 (17.2%) missing values | Missing |
pressure has 2986 (14.1%) missing values | pressure has 1466 (14.1%) missing values | Missing |
mass_flux has 3227 (15.2%) missing values | mass_flux has 1564 (15.0%) missing values | Missing |
d_e has 3641 (17.2%) missing values | d_e has 1847 (17.7%) missing values | Missing |
d_h has 3127 (14.7%) missing values | d_h has 1462 (14.0%) missing values | Missing |
length has 3157 (14.9%) missing values | length has 1602 (15.4%) missing values | Missing |
| Alert not present in | geometry is highly imbalanced (50.0%) | Imbalance |
Reproduction
| Dataset A | Dataset B | |
|---|---|---|
| Analysis started | 2023-05-31 22:43:57.180655 | 2023-05-31 22:44:01.684720 |
| Analysis finished | 2023-05-31 22:44:01.678589 | 2023-05-31 22:44:06.041947 |
| Duration | 4.5 seconds | 4.36 seconds |
| Software version | ydata-profiling v0.0.dev0 | ydata-profiling v0.0.dev0 |
| Download configuration | config.json | config.json |
author
Categorical
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 10 | 10 |
| Distinct (%) | 0.1% | 0.1% |
| Missing | 3403 | 1621 |
| Missing (%) | 16.0% | 15.6% |
| Memory size | 331.7 KiB | 162.7 KiB |
| Thompson | |
|---|---|
| Janssen | |
| Weatherhead | |
| Beus | 1087 |
| Peskov | 729 |
| Other values (5) |
| Thompson | |
|---|---|
| Janssen | |
| Weatherhead | |
| Beus | 517 |
| Peskov | 355 |
| Other values (5) |
Length
| Dataset A | Dataset B | |
|---|---|---|
| Max length | 12 | 12 |
| Median length | 8 | 8 |
| Mean length | 7.8993044 | 7.9031158 |
| Min length | 4 | 4 |
Characters and Unicode
| Dataset A | Dataset B | |
|---|---|---|
| Total characters | 140813 | 69500 |
| Distinct characters | 27 | 27 |
| Distinct categories | 2 | 2 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Dataset A | Dataset B | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Dataset A | Dataset B | |
|---|---|---|
| 1st row | Thompson | Peskov |
| 2nd row | Thompson | Thompson |
| 3rd row | Thompson | Thompson |
| 4th row | Beus | Beus |
| 5th row | Thompson | Weatherhead |
Common Values
| Value | Count | Frequency (%) |
| Thompson | 11621 | |
| Janssen | 1846 | 8.7% |
| Weatherhead | 1377 | 6.5% |
| Beus | 1087 | 5.1% |
| Peskov | 729 | 3.4% |
| Williams | 567 | 2.7% |
| Richenderfer | 371 | 1.7% |
| Mortimore | 130 | 0.6% |
| Kossolapov | 70 | 0.3% |
| Inasaka | 28 | 0.1% |
| (Missing) | 3403 | 16.0% |
| Value | Count | Frequency (%) |
| Thompson | 5775 | |
| Janssen | 870 | 8.4% |
| Weatherhead | 663 | 6.4% |
| Beus | 517 | 5.0% |
| Peskov | 355 | 3.4% |
| Williams | 324 | 3.1% |
| Richenderfer | 174 | 1.7% |
| Mortimore | 67 | 0.6% |
| Kossolapov | 31 | 0.3% |
| Inasaka | 18 | 0.2% |
| (Missing) | 1621 | 15.6% |
Length
Histogram of lengths of the category
Common Values (Plot)
Dataset A
Dataset B
| Value | Count | Frequency (%) |
| thompson | 11621 | |
| janssen | 1846 | 10.4% |
| weatherhead | 1377 | 7.7% |
| beus | 1087 | 6.1% |
| peskov | 729 | 4.1% |
| williams | 567 | 3.2% |
| richenderfer | 371 | 2.1% |
| mortimore | 130 | 0.7% |
| kossolapov | 70 | 0.4% |
| inasaka | 28 | 0.2% |
| Value | Count | Frequency (%) |
| thompson | 5775 | |
| janssen | 870 | 9.9% |
| weatherhead | 663 | 7.5% |
| beus | 517 | 5.9% |
| peskov | 355 | 4.0% |
| williams | 324 | 3.7% |
| richenderfer | 174 | 2.0% |
| mortimore | 67 | 0.8% |
| kossolapov | 31 | 0.4% |
| inasaka | 18 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 24441 | |
| s | 17864 | |
| n | 15712 | |
| h | 14746 | |
| m | 12318 | |
| p | 11691 | |
| T | 11621 | |
| e | 9036 | 6.4% |
| a | 5321 | 3.8% |
| r | 2379 | 1.7% |
| Other values (17) | 15684 |
| Value | Count | Frequency (%) |
| o | 12132 | |
| s | 8791 | |
| n | 7707 | |
| h | 7275 | |
| m | 6166 | |
| p | 5806 | |
| T | 5775 | |
| e | 4320 | 6.2% |
| a | 2605 | 3.7% |
| r | 1145 | 1.6% |
| Other values (17) | 7778 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 122987 | |
| Uppercase Letter | 17826 | 12.7% |
| Value | Count | Frequency (%) |
| Lowercase Letter | 60706 | |
| Uppercase Letter | 8794 | 12.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 24441 | |
| s | 17864 | |
| n | 15712 | |
| h | 14746 | |
| m | 12318 | |
| p | 11691 | |
| e | 9036 | 7.3% |
| a | 5321 | 4.3% |
| r | 2379 | 1.9% |
| d | 1748 | 1.4% |
| Other values (8) | 7731 | 6.3% |
| Value | Count | Frequency (%) |
| o | 12132 | |
| s | 8791 | |
| n | 7707 | |
| h | 7275 | |
| m | 6166 | |
| p | 5806 | |
| e | 4320 | 7.1% |
| a | 2605 | 4.3% |
| r | 1145 | 1.9% |
| i | 889 | 1.5% |
| Other values (8) | 3870 | 6.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 11621 | |
| W | 1944 | 10.9% |
| J | 1846 | 10.4% |
| B | 1087 | 6.1% |
| P | 729 | 4.1% |
| R | 371 | 2.1% |
| M | 130 | 0.7% |
| K | 70 | 0.4% |
| I | 28 | 0.2% |
| Value | Count | Frequency (%) |
| T | 5775 | |
| W | 987 | 11.2% |
| J | 870 | 9.9% |
| B | 517 | 5.9% |
| P | 355 | 4.0% |
| R | 174 | 2.0% |
| M | 67 | 0.8% |
| K | 31 | 0.4% |
| I | 18 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 140813 |
| Value | Count | Frequency (%) |
| Latin | 69500 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 24441 | |
| s | 17864 | |
| n | 15712 | |
| h | 14746 | |
| m | 12318 | |
| p | 11691 | |
| T | 11621 | |
| e | 9036 | 6.4% |
| a | 5321 | 3.8% |
| r | 2379 | 1.7% |
| Other values (17) | 15684 |
| Value | Count | Frequency (%) |
| o | 12132 | |
| s | 8791 | |
| n | 7707 | |
| h | 7275 | |
| m | 6166 | |
| p | 5806 | |
| T | 5775 | |
| e | 4320 | 6.2% |
| a | 2605 | 3.7% |
| r | 1145 | 1.6% |
| Other values (17) | 7778 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 140813 |
| Value | Count | Frequency (%) |
| ASCII | 69500 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 24441 | |
| s | 17864 | |
| n | 15712 | |
| h | 14746 | |
| m | 12318 | |
| p | 11691 | |
| T | 11621 | |
| e | 9036 | 6.4% |
| a | 5321 | 3.8% |
| r | 2379 | 1.7% |
| Other values (17) | 15684 |
| Value | Count | Frequency (%) |
| o | 12132 | |
| s | 8791 | |
| n | 7707 | |
| h | 7275 | |
| m | 6166 | |
| p | 5806 | |
| T | 5775 | |
| e | 4320 | 6.2% |
| a | 2605 | 3.7% |
| r | 1145 | 1.6% |
| Other values (17) | 7778 |
geometry
Categorical
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 3 | 3 |
| Distinct (%) | < 0.1% | < 0.1% |
| Missing | 3713 | 1787 |
| Missing (%) | 17.5% | 17.2% |
| Memory size | 331.7 KiB | 162.7 KiB |
| tube | |
|---|---|
| annulus | |
| plate | 424 |
| tube | |
|---|---|
| annulus | |
| plate | 194 |
Length
| Dataset A | Dataset B | |
|---|---|---|
| Max length | 7 | 7 |
| Median length | 4 | 4 |
| Mean length | 4.5330555 | 4.5127492 |
| Min length | 4 | 4 |
Characters and Unicode
| Dataset A | Dataset B | |
|---|---|---|
| Total characters | 79401 | 38936 |
| Distinct characters | 9 | 9 |
| Distinct categories | 1 | 1 ? |
| Distinct scripts | 1 | 1 ? |
| Distinct blocks | 1 | 1 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Dataset A | Dataset B | |
|---|---|---|
| Unique | 0 | 0 ? |
| Unique (%) | 0.0% | 0.0% |
Sample
| Dataset A | Dataset B | |
|---|---|---|
| 1st row | tube | tube |
| 2nd row | tube | tube |
| 3rd row | annulus | tube |
| 4th row | tube | annulus |
| 5th row | tube | tube |
Common Values
| Value | Count | Frequency (%) |
| tube | 14121 | |
| annulus | 2971 | 14.0% |
| plate | 424 | 2.0% |
| (Missing) | 3713 | 17.5% |
| Value | Count | Frequency (%) |
| tube | 7024 | |
| annulus | 1410 | 13.5% |
| plate | 194 | 1.9% |
| (Missing) | 1787 | 17.2% |
Length
Histogram of lengths of the category
Common Values (Plot)
Dataset A
Dataset B
| Value | Count | Frequency (%) |
| tube | 14121 | |
| annulus | 2971 | 17.0% |
| plate | 424 | 2.4% |
| Value | Count | Frequency (%) |
| tube | 7024 | |
| annulus | 1410 | 16.3% |
| plate | 194 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 20063 | |
| t | 14545 | |
| e | 14545 | |
| b | 14121 | |
| n | 5942 | 7.5% |
| a | 3395 | 4.3% |
| l | 3395 | 4.3% |
| s | 2971 | 3.7% |
| p | 424 | 0.5% |
| Value | Count | Frequency (%) |
| u | 9844 | |
| t | 7218 | |
| e | 7218 | |
| b | 7024 | |
| n | 2820 | 7.2% |
| a | 1604 | 4.1% |
| l | 1604 | 4.1% |
| s | 1410 | 3.6% |
| p | 194 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79401 |
| Value | Count | Frequency (%) |
| Lowercase Letter | 38936 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 20063 | |
| t | 14545 | |
| e | 14545 | |
| b | 14121 | |
| n | 5942 | 7.5% |
| a | 3395 | 4.3% |
| l | 3395 | 4.3% |
| s | 2971 | 3.7% |
| p | 424 | 0.5% |
| Value | Count | Frequency (%) |
| u | 9844 | |
| t | 7218 | |
| e | 7218 | |
| b | 7024 | |
| n | 2820 | 7.2% |
| a | 1604 | 4.1% |
| l | 1604 | 4.1% |
| s | 1410 | 3.6% |
| p | 194 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 79401 |
| Value | Count | Frequency (%) |
| Latin | 38936 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 20063 | |
| t | 14545 | |
| e | 14545 | |
| b | 14121 | |
| n | 5942 | 7.5% |
| a | 3395 | 4.3% |
| l | 3395 | 4.3% |
| s | 2971 | 3.7% |
| p | 424 | 0.5% |
| Value | Count | Frequency (%) |
| u | 9844 | |
| t | 7218 | |
| e | 7218 | |
| b | 7024 | |
| n | 2820 | 7.2% |
| a | 1604 | 4.1% |
| l | 1604 | 4.1% |
| s | 1410 | 3.6% |
| p | 194 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79401 |
| Value | Count | Frequency (%) |
| ASCII | 38936 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 20063 | |
| t | 14545 | |
| e | 14545 | |
| b | 14121 | |
| n | 5942 | 7.5% |
| a | 3395 | 4.3% |
| l | 3395 | 4.3% |
| s | 2971 | 3.7% |
| p | 424 | 0.5% |
| Value | Count | Frequency (%) |
| u | 9844 | |
| t | 7218 | |
| e | 7218 | |
| b | 7024 | |
| n | 2820 | 7.2% |
| a | 1604 | 4.1% |
| l | 1604 | 4.1% |
| s | 1410 | 3.6% |
| p | 194 | 0.5% |
pressure
Real number (ℝ)
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 140 | 118 |
| Distinct (%) | 0.8% | 1.3% |
| Missing | 2986 | 1466 |
| Missing (%) | 14.1% | 14.1% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 10.635066 | 10.65233 |
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| Maximum | 20.68 | 20.68 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 331.7 KiB | 162.7 KiB |
Quantile statistics
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 0.1 | 0.1 |
| 5-th percentile | 3.45 | 3.45 |
| Q1 | 6.89 | 6.89 |
| median | 11.03 | 11.07 |
| Q3 | 13.79 | 13.79 |
| 95-th percentile | 17.24 | 17.24 |
| Maximum | 20.68 | 20.68 |
| Range | 20.58 | 20.58 |
| Interquartile range (IQR) | 6.9 | 6.9 |
Descriptive statistics
| Dataset A | Dataset B | |
|---|---|---|
| Standard deviation | 4.3329433 | 4.3354087 |
| Coefficient of variation (CV) | 0.40742046 | 0.40699159 |
| Kurtosis | -0.55906368 | -0.55938547 |
| Mean | 10.635066 | 10.65233 |
| Median Absolute Deviation (MAD) | 2.76 | 2.8 |
| Skewness | -0.34582278 | -0.35270669 |
| Sum | 194015.5 | 95327.7 |
| Variance | 18.774397 | 18.795768 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 13.79 | 6179 | |
| 6.89 | 3145 | |
| 15.51 | 743 | 3.5% |
| 10.34 | 719 | 3.4% |
| 11.03 | 598 | 2.8% |
| 3.45 | 409 | 1.9% |
| 6.86 | 405 | 1.9% |
| 12.07 | 355 | 1.7% |
| 17.24 | 348 | 1.6% |
| 18.96 | 332 | 1.6% |
| Other values (130) | 5010 | |
| (Missing) | 2986 |
| Value | Count | Frequency (%) |
| 13.79 | 3047 | |
| 6.89 | 1556 | |
| 15.51 | 374 | 3.6% |
| 10.34 | 345 | 3.3% |
| 11.03 | 274 | 2.6% |
| 3.45 | 230 | 2.2% |
| 6.86 | 196 | 1.9% |
| 0.1 | 178 | 1.7% |
| 12.07 | 175 | 1.7% |
| 18.96 | 167 | 1.6% |
| Other values (108) | 2407 | |
| (Missing) | 1466 |
| Value | Count | Frequency (%) |
| 0.1 | 315 | |
| 0.2 | 83 | 0.4% |
| 0.3 | 1 | < 0.1% |
| 0.31 | 6 | < 0.1% |
| 0.33 | 5 | < 0.1% |
| 0.34 | 2 | < 0.1% |
| 0.36 | 1 | < 0.1% |
| 0.39 | 6 | < 0.1% |
| 0.51 | 98 | 0.5% |
| 0.62 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.1 | 178 | |
| 0.2 | 34 | 0.3% |
| 0.3 | 1 | < 0.1% |
| 0.31 | 3 | < 0.1% |
| 0.33 | 3 | < 0.1% |
| 0.39 | 7 | 0.1% |
| 0.51 | 38 | 0.4% |
| 0.62 | 3 | < 0.1% |
| 0.64 | 5 | < 0.1% |
| 0.91 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.1 | 178 | |
| 0.2 | 34 | 0.2% |
| 0.3 | 1 | < 0.1% |
| 0.31 | 3 | < 0.1% |
| 0.33 | 3 | < 0.1% |
| 0.39 | 7 | < 0.1% |
| 0.51 | 38 | 0.2% |
| 0.62 | 3 | < 0.1% |
| 0.64 | 5 | < 0.1% |
| 0.91 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.1 | 315 | |
| 0.2 | 83 | 0.8% |
| 0.3 | 1 | < 0.1% |
| 0.31 | 6 | 0.1% |
| 0.33 | 5 | < 0.1% |
| 0.34 | 2 | < 0.1% |
| 0.36 | 1 | < 0.1% |
| 0.39 | 6 | 0.1% |
| 0.51 | 98 | 0.9% |
| 0.62 | 6 | 0.1% |
mass_flux
Real number (ℝ)
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 689 | 619 |
| Distinct (%) | 3.8% | 7.0% |
| Missing | 3227 | 1564 |
| Missing (%) | 15.2% | 15.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 3070.4878 | 3062.9736 |
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 0 | 0 |
| Maximum | 7975 | 7975 |
| Zeros | 5 | 4 |
| Zeros (%) | < 0.1% | < 0.1% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 331.7 KiB | 162.7 KiB |
Quantile statistics
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 0 | 0 |
| 5-th percentile | 833 | 783 |
| Q1 | 1505 | 1519 |
| median | 2730 | 2740 |
| Q3 | 4069 | 4069 |
| 95-th percentile | 6347 | 6225.5 |
| Maximum | 7975 | 7975 |
| Range | 7975 | 7975 |
| Interquartile range (IQR) | 2564 | 2550 |
Descriptive statistics
| Dataset A | Dataset B | |
|---|---|---|
| Standard deviation | 1784.8731 | 1761.0661 |
| Coefficient of variation (CV) | 0.58129954 | 0.57495309 |
| Kurtosis | -0.15296149 | -0.18679136 |
| Mean | 3070.4878 | 3062.9736 |
| Median Absolute Deviation (MAD) | 1326 | 1316 |
| Skewness | 0.72138393 | 0.69296372 |
| Sum | 55274921 | 27110379 |
| Variance | 3185772.1 | 3101353.8 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 4069 | 623 | 2.9% |
| 1519 | 437 | 2.1% |
| 1356 | 411 | 1.9% |
| 2034 | 341 | 1.6% |
| 1000 | 291 | 1.4% |
| 1383 | 181 | 0.9% |
| 3811 | 166 | 0.8% |
| 2292 | 164 | 0.8% |
| 1533 | 160 | 0.8% |
| 4055 | 158 | 0.7% |
| Other values (679) | 15070 | |
| (Missing) | 3227 | 15.2% |
| Value | Count | Frequency (%) |
| 4069 | 340 | 3.3% |
| 1356 | 204 | 2.0% |
| 1519 | 197 | 1.9% |
| 2034 | 192 | 1.8% |
| 1000 | 127 | 1.2% |
| 4096 | 101 | 1.0% |
| 1383 | 85 | 0.8% |
| 2292 | 83 | 0.8% |
| 3784 | 83 | 0.8% |
| 3838 | 81 | 0.8% |
| Other values (609) | 7358 | |
| (Missing) | 1564 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 5 | < 0.1% |
| 4 | 4 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 310 | 10 | |
| 332 | 2 | < 0.1% |
| 336 | 9 | |
| 339 | 8 | |
| 340 | 19 |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 8 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 148 | 1 | < 0.1% |
| 310 | 7 | |
| 332 | 2 | < 0.1% |
| 336 | 6 | |
| 339 | 2 | < 0.1% |
| 340 | 6 | |
| 346 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 8 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 148 | 1 | < 0.1% |
| 310 | 7 | |
| 332 | 2 | < 0.1% |
| 336 | 6 | |
| 339 | 2 | < 0.1% |
| 340 | 6 | |
| 346 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 5 | < 0.1% |
| 4 | 4 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 310 | 10 | |
| 332 | 2 | < 0.1% |
| 336 | 9 | |
| 339 | 8 | |
| 340 | 19 |
d_e
Real number (ℝ)
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 41 | 37 |
| Distinct (%) | 0.2% | 0.4% |
| Missing | 3641 | 1847 |
| Missing (%) | 17.2% | 17.7% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 8.5893052 | 8.7112628 |
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 37.5 | 37.5 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 331.7 KiB | 162.7 KiB |
Quantile statistics
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1.9 | 1.9 |
| Q1 | 4.7 | 5 |
| median | 7.8 | 8.5 |
| Q3 | 10.8 | 10.8 |
| 95-th percentile | 15 | 15 |
| Maximum | 37.5 | 37.5 |
| Range | 36.5 | 36.5 |
| Interquartile range (IQR) | 6.1 | 5.8 |
Descriptive statistics
| Dataset A | Dataset B | |
|---|---|---|
| Standard deviation | 5.1322071 | 5.2931496 |
| Coefficient of variation (CV) | 0.59751132 | 0.60762138 |
| Kurtosis | 9.1373713 | 8.9280928 |
| Mean | 8.5893052 | 8.7112628 |
| Median Absolute Deviation (MAD) | 3 | 2.9 |
| Skewness | 2.1707251 | 2.1929428 |
| Sum | 151068.7 | 74638.1 |
| Variance | 26.33955 | 28.017432 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=41)
| Value | Count | Frequency (%) |
| 10.3 | 1790 | 8.4% |
| 1.9 | 1692 | 8.0% |
| 10.8 | 1686 | 7.9% |
| 4.7 | 1637 | 7.7% |
| 7.7 | 1552 | 7.3% |
| 5.6 | 1417 | 6.7% |
| 12.7 | 963 | 4.5% |
| 7.8 | 872 | 4.1% |
| 10 | 740 | 3.5% |
| 9.5 | 564 | 2.7% |
| Other values (31) | 4675 | |
| (Missing) | 3641 |
| Value | Count | Frequency (%) |
| 10.3 | 894 | 8.6% |
| 10.8 | 826 | 7.9% |
| 1.9 | 807 | 7.7% |
| 4.7 | 795 | 7.6% |
| 7.7 | 742 | 7.1% |
| 5.6 | 691 | 6.6% |
| 12.7 | 470 | 4.5% |
| 7.8 | 411 | 3.9% |
| 10 | 340 | 3.3% |
| 9.5 | 317 | 3.0% |
| Other values (27) | 2275 | |
| (Missing) | 1847 |
| Value | Count | Frequency (%) |
| 1 | 93 | 0.4% |
| 1.1 | 12 | 0.1% |
| 1.7 | 6 | < 0.1% |
| 1.9 | 1692 | |
| 3 | 325 | 1.5% |
| 3.6 | 188 | 0.9% |
| 4.6 | 457 | 2.2% |
| 4.7 | 1637 | |
| 5 | 130 | 0.6% |
| 5.6 | 1417 |
| Value | Count | Frequency (%) |
| 1 | 60 | 0.6% |
| 1.1 | 6 | 0.1% |
| 1.7 | 2 | < 0.1% |
| 1.9 | 807 | |
| 3 | 158 | 1.5% |
| 3.6 | 93 | 0.9% |
| 4.6 | 201 | 1.9% |
| 4.7 | 795 | |
| 5 | 59 | 0.6% |
| 5.6 | 691 |
| Value | Count | Frequency (%) |
| 1 | 60 | 0.3% |
| 1.1 | 6 | < 0.1% |
| 1.7 | 2 | < 0.1% |
| 1.9 | 807 | |
| 3 | 158 | 0.7% |
| 3.6 | 93 | 0.4% |
| 4.6 | 201 | 0.9% |
| 4.7 | 795 | |
| 5 | 59 | 0.3% |
| 5.6 | 691 |
| Value | Count | Frequency (%) |
| 1 | 93 | 0.9% |
| 1.1 | 12 | 0.1% |
| 1.7 | 6 | 0.1% |
| 1.9 | 1692 | |
| 3 | 325 | 3.1% |
| 3.6 | 188 | 1.8% |
| 4.6 | 457 | 4.4% |
| 4.7 | 1637 | |
| 5 | 130 | 1.2% |
| 5.6 | 1417 |
d_h
Real number (ℝ)
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 47 | 44 |
| Distinct (%) | 0.3% | 0.5% |
| Missing | 3127 | 1462 |
| Missing (%) | 14.7% | 14.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 14.215446 | 14.091198 |
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 1 | 1 |
| Maximum | 120 | 120 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 331.7 KiB | 162.7 KiB |
Quantile statistics
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 1 | 1 |
| 5-th percentile | 1.9 | 1.9 |
| Q1 | 5.6 | 5.6 |
| median | 10 | 10 |
| Q3 | 11.5 | 11.5 |
| 95-th percentile | 42.3 | 42.3 |
| Maximum | 120 | 120 |
| Range | 119 | 119 |
| Interquartile range (IQR) | 5.9 | 5.9 |
Descriptive statistics
| Dataset A | Dataset B | |
|---|---|---|
| Standard deviation | 19.913594 | 19.686604 |
| Coefficient of variation (CV) | 1.400842 | 1.3970852 |
| Kurtosis | 18.283387 | 18.705588 |
| Mean | 14.215446 | 14.091198 |
| Median Absolute Deviation (MAD) | 4.3 | 3.3 |
| Skewness | 4.1188611 | 4.1555643 |
| Sum | 257328 | 126158.5 |
| Variance | 396.55123 | 387.56239 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=47)
| Value | Count | Frequency (%) |
| 10.3 | 1869 | 8.8% |
| 10.8 | 1735 | 8.2% |
| 1.9 | 1727 | 8.1% |
| 4.7 | 1701 | 8.0% |
| 7.7 | 1587 | 7.5% |
| 15.2 | 1061 | 5.0% |
| 7.8 | 885 | 4.2% |
| 10 | 753 | 3.5% |
| 42.3 | 583 | 2.7% |
| 9.5 | 575 | 2.7% |
| Other values (37) | 5626 | |
| (Missing) | 3127 |
| Value | Count | Frequency (%) |
| 10.3 | 923 | 8.9% |
| 10.8 | 863 | 8.3% |
| 4.7 | 853 | 8.2% |
| 1.9 | 842 | 8.1% |
| 7.7 | 795 | 7.6% |
| 15.2 | 517 | 5.0% |
| 7.8 | 428 | 4.1% |
| 10 | 362 | 3.5% |
| 9.5 | 329 | 3.2% |
| 42.3 | 281 | 2.7% |
| Other values (34) | 2760 | |
| (Missing) | 1462 |
| Value | Count | Frequency (%) |
| 1 | 100 | 0.5% |
| 1.1 | 11 | 0.1% |
| 1.7 | 7 | < 0.1% |
| 1.9 | 1727 | |
| 3 | 326 | 1.5% |
| 3.6 | 192 | 0.9% |
| 4.6 | 397 | 1.9% |
| 4.7 | 1701 | |
| 5.6 | 377 | 1.8% |
| 5.7 | 283 | 1.3% |
| Value | Count | Frequency (%) |
| 1 | 65 | 0.6% |
| 1.1 | 6 | 0.1% |
| 1.7 | 2 | < 0.1% |
| 1.9 | 842 | |
| 3 | 166 | 1.6% |
| 3.6 | 93 | 0.9% |
| 4.6 | 207 | 2.0% |
| 4.7 | 853 | |
| 5.6 | 193 | 1.9% |
| 5.7 | 117 | 1.1% |
| Value | Count | Frequency (%) |
| 1 | 65 | 0.3% |
| 1.1 | 6 | < 0.1% |
| 1.7 | 2 | < 0.1% |
| 1.9 | 842 | |
| 3 | 166 | 0.8% |
| 3.6 | 93 | 0.4% |
| 4.6 | 207 | 1.0% |
| 4.7 | 853 | |
| 5.6 | 193 | 0.9% |
| 5.7 | 117 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 100 | 1.0% |
| 1.1 | 11 | 0.1% |
| 1.7 | 7 | 0.1% |
| 1.9 | 1727 | |
| 3 | 326 | 3.1% |
| 3.6 | 192 | 1.8% |
| 4.6 | 397 | 3.8% |
| 4.7 | 1701 | |
| 5.6 | 377 | 3.6% |
| 5.7 | 283 | 2.7% |
length
Real number (ℝ)
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 65 | 60 |
| Distinct (%) | 0.4% | 0.7% |
| Missing | 3157 | 1602 |
| Missing (%) | 14.9% | 15.4% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 830.56496 | 837.95484 |
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 10 | 10 |
| Maximum | 3048 | 3048 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 331.7 KiB | 162.7 KiB |
Quantile statistics
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 10 | 10 |
| 5-th percentile | 100 | 100 |
| Q1 | 318 | 318 |
| median | 610 | 610 |
| Q3 | 914 | 914 |
| 95-th percentile | 2134 | 2134 |
| Maximum | 3048 | 3048 |
| Range | 3038 | 3038 |
| Interquartile range (IQR) | 596 | 596 |
Descriptive statistics
| Dataset A | Dataset B | |
|---|---|---|
| Standard deviation | 671.14217 | 674.67667 |
| Coefficient of variation (CV) | 0.808055 | 0.80514681 |
| Kurtosis | -0.10396928 | -0.12321172 |
| Mean | 830.56496 | 837.95484 |
| Median Absolute Deviation (MAD) | 292 | 292 |
| Skewness | 1.0317481 | 1.0169489 |
| Sum | 15009970 | 7384896 |
| Variance | 450431.81 | 455188.6 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 457 | 2140 | 10.1% |
| 762 | 1575 | 7.4% |
| 318 | 1456 | 6.9% |
| 2134 | 1252 | 5.9% |
| 152 | 1227 | 5.8% |
| 432 | 1013 | 4.8% |
| 591 | 897 | 4.2% |
| 1778 | 792 | 3.7% |
| 914 | 673 | 3.2% |
| 864 | 617 | 2.9% |
| Other values (55) | 6430 | |
| (Missing) | 3157 |
| Value | Count | Frequency (%) |
| 457 | 1040 | 10.0% |
| 762 | 789 | 7.6% |
| 318 | 698 | 6.7% |
| 2134 | 581 | 5.6% |
| 152 | 573 | 5.5% |
| 432 | 499 | 4.8% |
| 591 | 405 | 3.9% |
| 1778 | 353 | 3.4% |
| 914 | 343 | 3.3% |
| 1836 | 331 | 3.2% |
| Other values (50) | 3201 | |
| (Missing) | 1602 |
| Value | Count | Frequency (%) |
| 10 | 479 | |
| 12 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 25 | 34 | 0.2% |
| 35 | 75 | 0.4% |
| 38 | 33 | 0.2% |
| 43 | 5 | < 0.1% |
| 51 | 42 | 0.2% |
| 64 | 15 | 0.1% |
| 76 | 187 | 0.9% |
| Value | Count | Frequency (%) |
| 10 | 230 | |
| 25 | 22 | 0.2% |
| 35 | 25 | 0.2% |
| 38 | 23 | 0.2% |
| 43 | 2 | < 0.1% |
| 51 | 23 | 0.2% |
| 64 | 7 | 0.1% |
| 76 | 95 | |
| 96 | 1 | < 0.1% |
| 100 | 21 | 0.2% |
| Value | Count | Frequency (%) |
| 10 | 230 | |
| 25 | 22 | 0.1% |
| 35 | 25 | 0.1% |
| 38 | 23 | 0.1% |
| 43 | 2 | < 0.1% |
| 51 | 23 | 0.1% |
| 64 | 7 | < 0.1% |
| 76 | 95 | |
| 96 | 1 | < 0.1% |
| 100 | 21 | 0.1% |
| Value | Count | Frequency (%) |
| 10 | 479 | |
| 12 | 1 | < 0.1% |
| 20 | 1 | < 0.1% |
| 25 | 34 | 0.3% |
| 35 | 75 | 0.7% |
| 38 | 33 | 0.3% |
| 43 | 5 | < 0.1% |
| 51 | 42 | 0.4% |
| 64 | 15 | 0.1% |
| 76 | 187 | 1.8% |
chf_exp
Real number (ℝ)
| Dataset A | Dataset B | |
|---|---|---|
| Distinct | 109 | 109 |
| Distinct (%) | 0.5% | 1.0% |
| Missing | 0 | 0 |
| Missing (%) | 0.0% | 0.0% |
| Infinite | 0 | 0 |
| Infinite (%) | 0.0% | 0.0% |
| Mean | 3.809129 | 3.7722324 |
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 0.8 | 0.8 |
| Maximum | 19.3 | 19.3 |
| Zeros | 0 | 0 |
| Zeros (%) | 0.0% | 0.0% |
| Negative | 0 | 0 |
| Negative (%) | 0.0% | 0.0% |
| Memory size | 331.7 KiB | 162.7 KiB |
Quantile statistics
| Dataset A | Dataset B | |
|---|---|---|
| Minimum | 0.8 | 0.8 |
| 5-th percentile | 1.6 | 1.6 |
| Q1 | 2.4 | 2.4 |
| median | 3.4 | 3.4 |
| Q3 | 4.7 | 4.6 |
| 95-th percentile | 7.5 | 7.5 |
| Maximum | 19.3 | 19.3 |
| Range | 18.5 | 18.5 |
| Interquartile range (IQR) | 2.3 | 2.2 |
Descriptive statistics
| Dataset A | Dataset B | |
|---|---|---|
| Standard deviation | 1.988009 | 1.9756402 |
| Coefficient of variation (CV) | 0.52190644 | 0.52373238 |
| Kurtosis | 6.0549246 | 5.7642107 |
| Mean | 3.809129 | 3.7722324 |
| Median Absolute Deviation (MAD) | 1.1 | 1.1 |
| Skewness | 1.8439524 | 1.8129017 |
| Sum | 80864 | 39287.8 |
| Variance | 3.9521797 | 3.9031544 |
| Monotonicity | Not monotonic | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.3 | 861 | 4.1% |
| 2.2 | 752 | 3.5% |
| 2.5 | 736 | 3.5% |
| 2.1 | 667 | 3.1% |
| 3.6 | 664 | 3.1% |
| 2.6 | 639 | 3.0% |
| 3.2 | 614 | 2.9% |
| 2.7 | 610 | 2.9% |
| 2 | 606 | 2.9% |
| 1.8 | 576 | 2.7% |
| Other values (99) | 14504 |
| Value | Count | Frequency (%) |
| 2.5 | 408 | 3.9% |
| 2.3 | 399 | 3.8% |
| 2.2 | 382 | 3.7% |
| 3.6 | 322 | 3.1% |
| 2.7 | 320 | 3.1% |
| 2.1 | 313 | 3.0% |
| 2.6 | 311 | 3.0% |
| 1.8 | 296 | 2.8% |
| 2 | 294 | 2.8% |
| 3.2 | 292 | 2.8% |
| Other values (99) | 7078 |
| Value | Count | Frequency (%) |
| 0.8 | 12 | 0.1% |
| 0.9 | 57 | 0.3% |
| 1 | 88 | 0.4% |
| 1.1 | 140 | |
| 1.2 | 107 | 0.5% |
| 1.3 | 140 | |
| 1.4 | 173 | |
| 1.5 | 243 | |
| 1.6 | 272 | |
| 1.7 | 154 |
| Value | Count | Frequency (%) |
| 0.8 | 4 | < 0.1% |
| 0.9 | 38 | 0.4% |
| 1 | 48 | 0.5% |
| 1.1 | 66 | 0.6% |
| 1.2 | 54 | 0.5% |
| 1.3 | 78 | |
| 1.4 | 105 | |
| 1.5 | 100 | |
| 1.6 | 170 | |
| 1.7 | 89 |
| Value | Count | Frequency (%) |
| 0.8 | 4 | < 0.1% |
| 0.9 | 38 | 0.2% |
| 1 | 48 | 0.2% |
| 1.1 | 66 | 0.3% |
| 1.2 | 54 | 0.3% |
| 1.3 | 78 | |
| 1.4 | 105 | |
| 1.5 | 100 | |
| 1.6 | 170 | |
| 1.7 | 89 |
| Value | Count | Frequency (%) |
| 0.8 | 12 | 0.1% |
| 0.9 | 57 | 0.5% |
| 1 | 88 | 0.8% |
| 1.1 | 140 | |
| 1.2 | 107 | 1.0% |
| 1.3 | 140 | |
| 1.4 | 173 | |
| 1.5 | 243 | |
| 1.6 | 272 | |
| 1.7 | 154 |
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
Dataset B
Dataset A
| pressure | mass_flux | d_e | d_h | length | chf_exp | author | geometry | |
|---|---|---|---|---|---|---|---|---|
| pressure | 1.000 | -0.227 | -0.579 | -0.544 | -0.152 | -0.314 | 0.370 | 0.588 |
| mass_flux | -0.227 | 1.000 | 0.006 | -0.079 | 0.059 | 0.352 | 0.191 | 0.219 |
| d_e | -0.579 | 0.006 | 1.000 | 0.806 | 0.407 | 0.066 | 0.342 | 0.465 |
| d_h | -0.544 | -0.079 | 0.806 | 1.000 | 0.625 | -0.086 | 0.631 | 0.902 |
| length | -0.152 | 0.059 | 0.407 | 0.625 | 1.000 | -0.283 | 0.431 | 0.511 |
| chf_exp | -0.314 | 0.352 | 0.066 | -0.086 | -0.283 | 1.000 | 0.152 | 0.176 |
| author | 0.370 | 0.191 | 0.342 | 0.631 | 0.431 | 0.152 | 1.000 | 0.962 |
| geometry | 0.588 | 0.219 | 0.465 | 0.902 | 0.511 | 0.176 | 0.962 | 1.000 |
Dataset B
| pressure | mass_flux | d_e | d_h | length | chf_exp | author | geometry | |
|---|---|---|---|---|---|---|---|---|
| pressure | 1.000 | -0.245 | -0.575 | -0.529 | -0.132 | -0.338 | 0.366 | 0.568 |
| mass_flux | -0.245 | 1.000 | 0.012 | -0.066 | 0.080 | 0.334 | 0.190 | 0.214 |
| d_e | -0.575 | 0.012 | 1.000 | 0.820 | 0.409 | 0.056 | 0.370 | 0.459 |
| d_h | -0.529 | -0.066 | 0.820 | 1.000 | 0.621 | -0.079 | 0.703 | 0.896 |
| length | -0.132 | 0.080 | 0.409 | 0.621 | 1.000 | -0.285 | 0.426 | 0.491 |
| chf_exp | -0.338 | 0.334 | 0.056 | -0.079 | -0.285 | 1.000 | 0.149 | 0.160 |
| author | 0.366 | 0.190 | 0.370 | 0.703 | 0.426 | 0.149 | 1.000 | 0.961 |
| geometry | 0.568 | 0.214 | 0.459 | 0.896 | 0.491 | 0.160 | 0.961 | 1.000 |
Dataset A
A simple visualization of nullity by column.
Dataset B
A simple visualization of nullity by column.
Dataset A
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Dataset B
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Dataset A
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Dataset B
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Dataset A
| author | geometry | pressure | mass_flux | d_e | d_h | length | chf_exp | |
|---|---|---|---|---|---|---|---|---|
| id | ||||||||
| 0 | Thompson | tube | 7.00 | 3770.0 | NaN | 10.8 | 432.0 | 3.6 |
| 1 | Thompson | tube | NaN | 6049.0 | 10.3 | 10.3 | 762.0 | 6.2 |
| 2 | Thompson | NaN | 13.79 | 2034.0 | 7.7 | 7.7 | 457.0 | 2.5 |
| 3 | Beus | annulus | 13.79 | 3679.0 | 5.6 | 15.2 | 2134.0 | 3.0 |
| 5 | NaN | NaN | 17.24 | 3648.0 | NaN | 1.9 | 696.0 | 3.6 |
| 6 | Thompson | NaN | 6.89 | 549.0 | 12.8 | 12.8 | 1930.0 | 2.6 |
| 8 | NaN | tube | 12.07 | 4042.0 | NaN | NaN | 152.0 | 5.6 |
| 9 | Peskov | tube | 12.00 | 1617.0 | 10.0 | 10.0 | 520.0 | 2.2 |
| 11 | Janssen | annulus | 4.13 | 1519.0 | 12.7 | NaN | 1778.0 | 4.6 |
| 13 | Peskov | tube | 12.00 | 2794.0 | 10.0 | NaN | 1650.0 | 2.9 |
Dataset B
| author | geometry | pressure | mass_flux | d_e | d_h | length | chf_exp | |
|---|---|---|---|---|---|---|---|---|
| id | ||||||||
| 4 | NaN | tube | 13.79 | 686.0 | 11.1 | 11.1 | 457.0 | 2.8 |
| 7 | Peskov | tube | 18.00 | 750.0 | 10.0 | 10.0 | 1650.0 | 2.2 |
| 10 | Thompson | tube | NaN | NaN | 1.9 | 1.9 | 152.0 | 3.2 |
| 12 | Thompson | NaN | 6.89 | 7500.0 | NaN | 12.8 | 1930.0 | 4.8 |
| 23 | Beus | annulus | 15.51 | 1355.0 | 5.6 | 15.2 | 2134.0 | 2.1 |
| 27 | Weatherhead | NaN | 13.79 | NaN | 11.1 | 11.1 | 457.0 | 3.5 |
| 34 | Thompson | tube | 13.79 | 1275.0 | NaN | 7.8 | 591.0 | 2.4 |
| 36 | Thompson | tube | 13.79 | 1655.0 | 7.7 | 7.7 | 457.0 | 3.8 |
| 39 | Thompson | tube | 13.79 | 5588.0 | 5.7 | 5.7 | 625.0 | 3.3 |
| 43 | Thompson | tube | 18.96 | 2699.0 | 1.9 | 1.9 | 696.0 | 2.2 |
Dataset A
| author | geometry | pressure | mass_flux | d_e | d_h | length | chf_exp | |
|---|---|---|---|---|---|---|---|---|
| id | ||||||||
| 31627 | Thompson | tube | 0.64 | 3282.0 | 3.0 | 3.0 | 100.0 | 7.1 |
| 31628 | Thompson | tube | 13.79 | 1302.0 | 4.7 | 4.7 | NaN | 2.5 |
| 31630 | Janssen | annulus | 6.89 | 2807.0 | 6.4 | NaN | 914.0 | 4.5 |
| 31631 | Thompson | tube | 3.86 | NaN | 10.8 | 10.8 | 432.0 | 4.1 |
| 31635 | Thompson | tube | 17.24 | 2984.0 | 1.9 | 1.9 | 152.0 | 3.9 |
| 31636 | NaN | NaN | 12.07 | NaN | NaN | 1.9 | 152.0 | 5.4 |
| 31638 | Thompson | tube | NaN | 3648.0 | 4.7 | 4.7 | 318.0 | 9.0 |
| 31639 | Thompson | NaN | NaN | 1736.0 | NaN | 7.8 | 591.0 | 2.3 |
| 31641 | Thompson | NaN | 18.27 | 658.0 | 3.0 | 3.0 | 150.0 | 2.3 |
| 31643 | NaN | tube | 6.89 | 7568.0 | 12.8 | 12.8 | 1930.0 | 3.3 |
Dataset B
| author | geometry | pressure | mass_flux | d_e | d_h | length | chf_exp | |
|---|---|---|---|---|---|---|---|---|
| id | ||||||||
| 31620 | Thompson | NaN | 15.51 | 3024.0 | 1.9 | 1.9 | NaN | 6.4 |
| 31621 | Thompson | tube | 6.86 | 4062.0 | 10.8 | NaN | 1727.0 | 4.2 |
| 31625 | Thompson | tube | NaN | 3637.0 | 4.6 | 4.6 | 229.0 | 12.8 |
| 31629 | Thompson | NaN | 13.79 | 4964.0 | NaN | 4.7 | 318.0 | 3.9 |
| 31632 | Thompson | tube | 18.27 | 833.0 | NaN | NaN | 150.0 | 4.1 |
| 31633 | Thompson | tube | 11.03 | NaN | 11.5 | 11.5 | NaN | 2.0 |
| 31634 | Richenderfer | plate | 1.01 | 2000.0 | 15.0 | 120.0 | 10.0 | 6.2 |
| 31637 | Weatherhead | tube | 13.79 | 688.0 | NaN | 11.1 | 457.0 | 2.3 |
| 31640 | NaN | NaN | 13.79 | NaN | 4.7 | 4.7 | NaN | 3.9 |
| 31642 | Thompson | tube | 6.89 | 3825.0 | 23.6 | 23.6 | 1972.0 | 3.7 |
Dataset A
| author | geometry | pressure | mass_flux | d_e | d_h | length | chf_exp | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|
| 356 | Thompson | tube | 13.79 | 1233.0 | 7.8 | 7.8 | 591.0 | 2.3 | 9 |
| 531 | Thompson | tube | 13.79 | NaN | 7.8 | 7.8 | 591.0 | 2.6 | 9 |
| 529 | Thompson | tube | 13.79 | NaN | 7.8 | 7.8 | 591.0 | 2.3 | 8 |
| 514 | Thompson | tube | 13.79 | NaN | 7.7 | 7.7 | 457.0 | 2.3 | 7 |
| 520 | Thompson | tube | 13.79 | NaN | 7.7 | 7.7 | 457.0 | 3.6 | 6 |
| 449 | Thompson | tube | 13.79 | 3648.0 | 4.7 | 4.7 | 318.0 | 3.2 | 5 |
| 527 | Thompson | tube | 13.79 | NaN | 7.8 | 7.8 | 591.0 | 2.0 | 5 |
| 685 | Weatherhead | tube | 13.79 | NaN | 7.7 | 7.7 | 457.0 | 2.6 | 5 |
| 4 | Beus | annulus | 11.03 | 1355.0 | 5.6 | 15.2 | 2134.0 | 2.1 | 4 |
| 27 | Beus | annulus | 13.79 | NaN | 5.6 | 15.2 | 2134.0 | 2.1 | 4 |
Dataset B
| author | geometry | pressure | mass_flux | d_e | d_h | length | chf_exp | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|
| 118 | Thompson | tube | 13.79 | 1465.0 | 7.8 | 7.8 | 591.0 | 2.5 | 4 |
| 164 | Thompson | tube | 13.79 | NaN | 7.8 | 7.8 | 591.0 | 2.9 | 4 |
| 11 | Beus | annulus | 15.51 | NaN | 5.6 | 15.2 | 2134.0 | 1.6 | 3 |
| 16 | Beus | annulus | NaN | NaN | 5.6 | 15.2 | 2134.0 | 2.1 | 3 |
| 29 | Janssen | annulus | 6.89 | NaN | 8.5 | 22.3 | 2743.0 | 2.2 | 3 |
| 46 | Thompson | tube | 3.45 | 5696.0 | 10.3 | 10.3 | 762.0 | 4.4 | 3 |
| 76 | Thompson | tube | 6.89 | NaN | 37.5 | 37.5 | 1953.0 | 2.0 | 3 |
| 115 | Thompson | tube | 13.79 | 1370.0 | 7.7 | 7.7 | 457.0 | 4.5 | 3 |
| 124 | Thompson | tube | 13.79 | 1736.0 | 7.8 | 7.8 | 591.0 | 2.1 | 3 |
| 144 | Thompson | tube | 13.79 | NaN | 4.7 | 4.7 | 318.0 | 2.7 | 3 |